Comparability of Computerized Adaptive and Paper-Pencil Tests

نویسندگان

  • Hong Wang
  • Chingwei David Shin
چکیده

When a traditional Paper-Pencil Test (PPT) is delivered by computer, two types of computerization can be implemented. One is a linear Computer-Based Test (CBT) in which the paper version of the test is presented and administered via computers. In a linear CBT, the items on both versions are identical, in general, and scoring methods and procedures are the same. The change from PPT to CBT, therefore, only involves the change of administration mode. The other type of computerization is the Computerized Adaptive Testing (CAT) in which not only the medium of administration changes from paper to computer but also the test delivery algorithm turns from linear to adaptive. This adaptive testing paradigm allows the test items to be selected and administered so that they are tailored to each test taker’s ability. Therefore, in comparability studies, both the administration mode and paradigm effect on examinees’ performance should be examined to ensure the comparability of the CAT and its PPT counterpart. The administration mode and paradigm effect on examinees’ performance should be examined to ensure the comparability of the CAT and its PPT counterpart. Paradigm Effects The administration mode effect has been widely examined in the comparison of PPT and linear CBT. Although findings are not conclusive, there seems to be a trend indicating that the two versions are comparable across the administration mode (e.g. Paek, 2005, Wang, Jiao, Young, Brooks, & Olson 2007, 2008). When CAT is compared to its PPT counterpart, the mode effect and paradigm effect are confounded with each other. In order to separate the two effects and examine the paradigm effect, some studies have focused on comparability analysis between the linear CBT and CAT.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparability of Computer-based and Paper-based Versions of Writing Section of PET in Iranian EFL Context

Computer technology has provided language testing experts with opportunity to develop computerized versions of traditional paper-based language tests. New generations of TOEFL and Cambridge IELTS, BULATS, KET, PET are good examples of computer-based language tests. Since this new method of testing introduces new factors into the realm of language assessment ( e.g. modes of test delivery, famili...

متن کامل

Examining Differences in Examinee Performance in Paper and Pencil and Computerized Testing

The study evaluated the comparability of two versions of a certification test: a paperand-pencil test (PPT) and computer-based test (CBT). An effect size measure known as Cohen’s d and differential item functioning (DIF) analyses were used as measures of comparability at the test and item levels, respectively. Results indicated that the effect sizes were small (d < 0.20) and not statistically s...

متن کامل

Item pool design for computerized adaptive tests

Although item pools are critical to the proper functioning of computerized adaptive tests (CATs), there is very little in the research literature that indicates the desired features of an item pool. Most research articles use an existing item pool that was developed for other purposes. For example, Pastor, Dodd and Chang (2002) used items from NAEP; Wang and Kolen (2001) used an item pool from ...

متن کامل

A Review of Exposure Control Strategies for CAT and Potential Applications in MST

INTRODUCTION Computerized adaptive testing offers the advantages of more precise and efficient ability estimation and more flexible scheduling for testing when compared with conventional linear tests. Both item-level and testlet-level computerized adaptive tests have proved to be useful alternatives to the conventional paper-and-pencil tests. However, a potential problem is test security. In co...

متن کامل

Online Item Calibration for Q-matrix in CD-CAT

Item replenishment is important to maintaining a large scale item bank. In this paper we consider calibrating new items based on pre-calibrated operational items under the DINA model, the specification of which includes the so-called Q-matrix, as well as the slipping and guessing parameters. Making use of the maximum likelihood and Bayesian estimators for the latent knowledge states, we propose...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010